Combining Social Network Analysis with Semi-supervised Clustering: a case study on fraud detection
نویسندگان
چکیده
At time of crisis, when fraud permanently frightens the basis of modern societies, the existence of effective tools to prevent it, or just to identify it in time, is critical. However, the detection of fraud is naturally impaired (among other issues) by the difficulty on labelling data, due to the cost of identifying and attest fraud. Moreover, the inability to incorporate domain knowledge in the mining process makes classifiers to use inadequate attributes to distinguish entities, ignoring most of existing relevant information, like entities’ social relations. In this paper, we propose a new methodology for enriching semi-supervised clustering with information collected through the analysis of social networks. The methodology is then applied on tax fraud detection and assessed by measuring the impact of this enrichment in the accuracy of semi-supervised clustering methods.
منابع مشابه
MEFUASN: A Helpful Method to Extract Features using Analyzing Social Network for Fraud Detection
Fraud detection is one of the ways to cope with damages associated with fraudulent activities that have become common due to the rapid development of the Internet and electronic business. There is a need to propose methods to detect fraud accurately and fast. To achieve to accuracy, fraud detection methods need to consider both kind of features, features based on user level and features based o...
متن کاملOutlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data
Outlier detection is useful for credit card fraud detection. Due to drastic increase in digital frauds, there is a lot of financial losses and therefore various techniques are developed for fraud detection and applied to diverse business fields. In high-dimensional data, outlier detection presents some challenges because of increment of dimensionality. In this paper, the proposed model aims to ...
متن کاملDetecting fraud in online games of chance and lotteries
Fraud detection has been an important topic of research in the data mining community for the past two decades. Supervised, semi-supervised, and unsupervised approaches to fraud detection have been proposed for the telecommunications, credit, insurance and health-care industries. We describe a novel hybrid system for detecting fraud in the highly growing lotteries and online games of chance sect...
متن کاملImproved competitive learning neural networks for network intrusion and fraud detection
In this research, we propose two new clustering algorithms, the improved competitive learning network (ICLN) and the supervised improved competitive learning network (SICLN), for fraud detection and network intrusion detection. The ICLN is an unsupervised clustering algorithm, which applies new rules to the standard competitive learning neural network (SCLN). The network neurons in the ICLN are...
متن کاملSurvey of Clustering Applications
Data mining is the process of collecting and analyzing useful patterns from huge amount of data, it has five major functions, clustering is one of them. In clustering, we make clusters of same data. The items in one group of cluster are alike while different from items which are in some other group of cluster. In image segmentation clustering approach is used for making segments of images and i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011